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Abstract. Classical and quantum statistical mechanics are cast here in 
the language of projective geometry to provide a unified geometrical frame- 
work for statistical physics. After reviewing the Hilbert space formulation of 
classical statistical thermodynamics, we introduce projective geometry as a 
basis for analysing probabilistic aspects of statistical physics. In particular, 
the specification of a canonical polarity on RP n induces a Riemannian met- 
ric on the state space of statistical mechanics. In the case of the canonical 
ensemble, we show that equilibrium thermal states are determined by the 
Hamiltonian gradient flow with respect to this metric. This flow is concisely 
characterised by the fact that it induces a projective automorphism on the 
state manifold. The measurement problem for thermal systems is studied by 
the introduction of the concept of a random state. The general methodology 
is then extended to include the quantum mechanical dynamics of equilibrium 
thermal states. In this case the relevant state space is complex projective 
space, here regarded as a real manifold endowed with the natural Fubini- 
Study metric. A distinguishing feature of quantum thermal dynamics is the 
inherent multiplicity of thermal trajectories in the state space, associated 
with the nonuniqueness of the infinite temperature state. We are then led 
to formulate a geometric characterisation of the standard KMS-relation often 
considered in the context of C*-algebras. The example of a quantum spin 
one-half particle in heat bath is studied in detail. 

Keywords: Hilbert space geometry, Projective geometry, Equilibrium statisti- 
cal mechanics, Quantum dynamics 

I. INTRODUCTION 

One of the most fascinating advances in the application of modern differential geometry 
is its use in statistical physics, including quantum and statistical mechanics. The purpose 
of this paper is to develop a unified geometrical framework that allows for a natural charac- 
terisation of both of these aspects of statistical physics. 

In quantum mechanics, one typically works with square- integrable wave functions, i.e., 
elements of a complex Hilbert space 7i. This space possesses natural geometrical structures 
induced by its norm. However, in order to seek a compelling axiomatic formulation of quan- 
tum mechanics, it may be reasonable to work with a space of more direct physical relevance 
mill. This is not the Hilbert space 7i itself, but rather the manifold X of "instantaneous 
pure states" [|J, which has the structure of a complex projective space CP n , possibly in- 
finite dimensional, enriched with a Hermitian correspondence, i.e., a complex conjugation 



1 



operation that maps points to hyperplanes in CP n } and vice-versa. Equivalently, we think 
of CP n as being endowed with a natural Riemannian metric, the Fubini-Study metric. 

The space £ is, in fact, the quantum analogue of the classical phase space of mechanical 
systems. Hence, one can interpret the Schrodinger equation as Hamilton's equations on CP n , 
and the equation of motion for a general density matrix can be identified with the Liouville 
equation ||. The advantage of working with the manifold E, rather than the Hilbert space 
of state vectors, above all, is that it can readily accommodate generalisations of quantum 
mechanics 0, including nonlinear relativistic models. Furthermore, the structure of S 
allows for a natural probabilistic interpretation even if the standard linear quantum theory 
is modified. 

As we discuss elsewhere ||, the statistical aspects of quantum measurement can be 
greatly clarified if we shift our view slightly, and regard the Hilbert space 7i of quantum 
mechanics not as a complex Hilbert space, but rather a real Hilbert space endowed with a 
real metric and a compatible complex structure. This would appear to be simply a change in 
formalism while keeping the same underlying physical structure. Indeed this is so, but once 
quantum theory is formulated this way its relation to other aspects of statistical physics 
becomes much more apparent. 

Statistical mechanics, in particular, can also be formulated concisely |F] in terms of the 
geometry of a real Hilbert space TC. This can be seen by taking the square root of the Gibbs 
density function, which maps the space of probability distributions to vectors in a convex 
cone Tt + in TC. In this way, the various probabilistic and statistical operations of statistical 
mechanics can be given a transparent geometric meaning in TC H§J. 

However, it can be argued that even at the classical level of statistical mechanics the 
'true' state space is obtained by identifying all the pure states along the given ray through 
the origin of TC. In this case, the space obtained is essentially the real projective space RP n . 
This is the view we take here, and we shall study properties of thermal states that become 
apparent only when the theory is developed in a fully geometric context. 

The present paper is organised as follows. In Section 2, we review the basics of the 
Hilbert space formulation of statistical mechanics. Since this formulation is perhaps not 
very widely appreciated, we can regard this section as an extended introduction which then 
paves the way to the approach in terms of projective geometry presented later. We begin 
with a brief review of statistical geometry, including the theory of the Fisher-Rao metric on 
the parameter space of a family of probability distributions. The Gibbs distribution when 
viewed in this way can be seen as a curve in Hilbert space, parameterised by the inverse 
temperature, and there is a striking formal resemblance to the Schrodinger equation, even 
though here we are working at a strictly classical level. 

A measurement theory for thermal states is developed by analogy with the standard 
density matrix theory used in quantum mechanics. We are then led to a set of uncertainty 
relations for the measurements of thermodynamic conjugate variables such as energy and 
inverse temperature. We also introduce an alternative approach to the measurement theory 
that is not based upon the density matrix description. Our approach, based on the intro- 
duction of random states, extends naturally also to quantum mechanics, where it can be 
seen to be more appealing in a probabilistic context than standard treatments, and indeed 
reduces to the conventional density matrix approach in special cases. 

In Section 3, we introduce a projective geometric framework for the probabilistic op- 
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erations involved in the representation of the canonical thermal state associated with the 
standard Gibbs measure. Thermal states are shown to lie on a trajectory in the real projec- 
tive space RP n , which is endowed with the natural 'spherical' metric. In this connection we 
find it convenient to develop a number of useful differential geometric results characterising 
projective transformations on the state space RP n . We find that the equilibrium thermal 
trajectories, which are shown to be given by a Hamiltonian gradient flow, generate projective 
automorphisms of the state manifold. 

In Section 4, we then synthesise the approaches outlined in Sections 2 and 3, and consider 
the inter-relationship of the classical thermal state space RP n and the quantum phase space 
CP n , to study the quantum mechanical dynamics of equilibrium thermal states. First we 
examine the quantum state space from the viewpoint of complex algebraic geometry, which 
shows that this space is endowed with a natural Riemannian geometry given by the Fubini- 
Study metric, along with a natural symplectic structure. For thermal physics it is instructive 
to look at quantum mechanics from an entirely 'real' point of view as well, and this approach 
is developed in Section 4.B. 

Our formulation is then compared to the standard KMS-construction |jTD[ for equilibrium 
states. In particular, once we pass to the mixed state description we recover the KMS-state. 
However, our quantum mechanical pure thermal state, which does not obey the KMS- 
condition, can be viewed as a more fundamental construction. In Section 4.D we develop a 
theory of the quantum mechanical microcanonical ensemble, formulated entirely in terms of 
the quantum phase space geometry. This is set up in such a way as to admit generalisations 
to nonlinear quantum theories. Finally, we study more explicitly the case of a quantum 
mechanical spin one-half particle in heat bath. 



II. STATISTICAL STATES IN HILBERT SPACE 

A. Hilbert space geometry 

Let us begin by demonstrating how classical statistical mechanics can be formulated in an 
appealing way by the use of a geometrical formalism appropriate for Hilbert space. Consider 
a real Hilbert space Ti equipped with an inner product g a b- A probability density function 
p(x) can be mapped into H by taking the square-root ip(x) = (p(x)) 1 ^ 2 , which is denoted 
by a vector ip a in 7i. The normalisation condition f(ijj(x)) 2 dx = 1 is written g a bip a ip b = 1, 
indicating that ip a lies on the unit sphere S in Ti. Since a probability density function is 
nonnegative, the image of the map / : p(x) — > ip(x) is the intersection «S+ = S D 7i + of 
S with the convex cone 7i + formed by the totality of quadratically integrable nonnegative 
functions. If we consider the space of all probability distributions as a metric space relative 



to the Hellinger distance [11], then / is an isometric embedding in TC. We call ip a the state 
vector of the corresponding probability density p(x). 

A typical random variable is represented on 7i by a symmetric tensor X ab , whose expec- 
tation in a normalised state ifj a is given by 

e^[x) = x ab r^ b . (i) 

Similarly, the expectation of its square is X ac X£ip a ip b . The variance of X ab in the state ip a 
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is therefore Var^,[X] = X ac X£ip a ip b , where X a b = X a b — g a bE^[X] represents the deviation of 
X a b from its mean in the state ip a . 

We consider now the unit sphere S in 7i, and within this sphere a submanifold M. given 
parametrically by ip a (0), where 9 l (i = 1, • • • , r) are local parameters. In particular, later on 
we have in mind the case where the parameter space spanned by 9 % represents the space of 
coupling constants in statistical mechanics associated with the given physical system. In the 
case of the canonical Gibbs measure there is a single such parameter, corresponding to the 
inverse temperature variable f3 = 1/ksT. We write di for d/d8 l . Then, in local coordinates, 
there is a natural Riemannian metric Qij on the parameter space A4, induced by g a b, given by 
Qij = g a bdiip a dj%l) b . This can be seen as follows. First, note that the squared distance between 
the endpoints of two vectors ip a and i] a in 7i is g a b{ip a ~ ri a )(%p b — r] b ). If both endpoints lie on 
Ai, and r] a is obtained by infinitesimally displacing ip a in A4, i.e., if 1 = ip a + diip a d6 l , then 
the separation ds between the two endpoints on M. is ds 2 = QijdQ l dQi , where Q^ is given as 
above. 

The metric Qij is, up to a conventional, irrelevant factor of four, the so-called Fisher-Rao 
metric on the space of the given family of distributions. The Fisher-Rao metric us usually 
defined in terms of a rather complicated expression involving the covariance matrix of the 
gradient of the log-likelihood function; but here we have a simple, transparent geometrical 
construction. The Fisher-Rao metric is important since it provides a geometrical basis for 
the key links between the statistical and physical aspects of the systems under consideration. 



B. Thermal trajectories 

Now suppose we consider the canonical ensemble of classical statistical mechanics, in the 
case for which the system is characterised by a configuration space and an assignment of 
an energy value for each configuration. The parametrised family of probability distributions 
then takes the form of the Gibbs measure 

p(H, (3) = q(x) exp [-f3H{x) - W{(3)} , (2) 

where the variable x ranges over the configuration space, H(x) represents the energy, W(/3) 
is a normalisation factor, and q{x) determines the distribution at (5 = 0, where f3 is the 
inverse temperature parameter. 

We now formulate a Hilbert space characterisation of this distribution. Taking the 
square- root of p(H, (3), we find that the state vector ip a ([3) in H corresponding to the Gibbs 
distribution @ satisfies the differential equation 

where H a b = H a b — 9abE^[H]. Here the operator H a b in the Hilbert space Ti corresponds to 
the specified Hamiltonian function H(x) appearing in (Q). The solution of this equation can 
be represented as follows: 



= exp 



<f , (4) 
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where W((3) = W((3) - W(0) and q a = i[) a (Q) is the prescribed distribution at /3 = 0. 

Since ip a {f3) respects the normalisation g a b4 ,a "4 )b = 1> f° r each value of the temperature f3 
we find a point on M. in S + . To be more specific, the thermal system can be described as 
follows. Consider a unit sphere S in TC, whose axes label the configurations of the system, 
each of which has a definite energy. We let u\ denote an orthonormal basis in Ti. Here, 
the index k labels all the points in the phase space of the given statistical system. In other 
words, for each point in phase space we have a corresponding basis vector 11% in 7i for some 
value of k. With this choice of basis, a classical thermal state ip a {f3) can be expressed as a 
superposition 

^(J3) = e-^W^e-s^ul , (5) 

k 

where Ej. is the energy for k-ih configuration, and thus exp[W(/3)] = J2k exp(— (3E^) is the 
partition function. We note that the states uf. are, in fact, the energy eigenstates of the 
system, with eigenvalues E^. That is to say, H^u\ = E^u^. The index k in these formulae is 
formal in the sense that the summation may, if appropriate, be replaced by an integration. 
By comparing equations (|j) and (|5|), we find that the initial (J3 = 0) thermal state q a is 

q a = c-^ (0) E«*. ( 6 ) 

k 

which corresponds to the centre point in S + . This relation reflects the fact that all configu- 
rations are equally probable likely at infinite temperature. 

Viewed as a function of (3, the state trajectory ip a (l3) thus commences at the centre 
point q a , and follows a curve on S generated by the Hamiltonian H a b according to (|3]). It is 
interesting to note that the curvature of this trajectory, given by 

K (g) = (H± _ (HT _ 1 m 

arises naturally in a physical characterisation of the accuracy bounds for temperature mea- 
surements. This point is pursued further in Section 2.C below, and in 0. In equation (0) the 
expression (H n ) denotes the n-th central moment of the observable Hat,. Here the curvature 
of the curve if) a ((3), which is necessarily positive, is the square of the 'acceleration' vector 
along the state trajectory ip a ((3), normalised by the square of the velocity vector. 



C. Measurement for thermal states 

Given the thermal trajectory ip a ((3) above, we propose, in the first instance, to con- 
sider measurement and estimation by analogy with the von Neumann approach in quantum 
mechanics. According to this scheme the general state of a thermodynamic system is repre- 
sented by a 'density matrix' p ab which in the present context should be understood to be a 
symmetric, semidefinite matrix with trace unity; that is to say, p ab £,a£,b > for any covector 
£ a , and p ab g a b = 1- Then, for example, we can write 

E p [X] = X abP ab (8) 
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for the expectation of a random variable X ab in the state p ab , and 

Var p [X] = X ab X b c p ac - (X ab p ab ) 2 (9) 

for the variance of X ab in that state. It should be evident that in the case of a pure state, 
for which 

p ab ig 

of the form p ab = ijj a ijj b for some state vector ifj a , these formulae reduce to 
the expressions considered in Section 2. A. 

In particular, let us consider measurements made on a pure equilibrium state ip a {(3). 
Such measurements are characterised by projecting the prescribed state onto the ray in the 
Hilbert space corresponding to a specified point in the phase space. Hence, the probabil- 
ity of observing the fc-th state, when the system is in the pure state ip a , is given by the 
corresponding Boltzmann weight 

Pk = (9a b ru b k ) 2 = e"^ . (10) 

In terms of the density matrix description, the state before measurement is given by the 
degenerate pure state matrix p ab = ip a ifj b , for which the thermal development is 



dp' 



lib 



dp 



-i(i7 c y c + #y c ) , (ii) 



or equivalently dp/df3 = —{H,p}, where {A, B} denotes the symmetric product between 
the operators A and B. After a measurement, p ab takes the form of a mixed state, charac- 
terised by a nondegenerate diagonal density operator for which the diagonal elements are 
the Boltzmann weights p\~. In this state vector reduction picture, the von Neumann entropy 
— Trfplnp] changes from to its maximum value S = (3(H) + Wp, which can be viewed as 
the quantity of information gained from the observation. 

More generally, suppose we consider the measurement of an arbitrary observable X ab in 
the state ip a ((3) in the situation when the spectrum of X ab admits a continuous component. 
In this case, we consider the spectral measure associated with the random variable X ab . 
Then, the probability density for the measurement outcome x is given by the expectation 
p(x,/3) = Yl ab {X 1 x)ip a ip b of the projection operator 

1 f°° 

U a b (X, x) = -= / exp \i\[Xl - xb a b )\ d\ . (12) 



2tt 

In other words, we assign a projection- valued measure IIx(x) on the real line associated 
with each symmetric operator X, so that for a given unit vector ip a , the mapping x G R i— > 
E^\Yi x {x)\ is a probability measure. This measure determines the distribution of values 
obtained when the observable X is measured while the system is in the state ip. 

For a more refined view of the measurement problem we need to take into account some 
ideas from statistical estimation theory. Suppose that we want to make a measurement or 
series of measurements to estimate the value of the parameter characterising a given thermal 
equilibrium state. In this situation the observable we measure is called an 'estimator' for 
the given parameter. We are interested in the case for which the estimator is unbiased in 
the sense that its expectation gives the value of the required parameter. To be specific, we 
consider the case when we estimate the value of the temperature. Let B ab be an unbiased 
estimator for (3, so that along the trajectory ip a (P) we have: 
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aW y . = /? • 13 

As a consequence of this relation and the thermal state equation ([3]), we observe that the 
inverse temperature estimator B and the Hamiltonian H satisfy the 'weak' anticommutation 
relation E^[{B,H}] = — 1 along the state trajectory ip. In statistical terms, this implies 
that these conjugate variables satisfy the covariance relation E^,[BH] — E^,[B]E^[H] = — 1 
along the trajectory 



D. Thermodynamic uncertainty relations 

Equipped with the above definitions, one can easily verify that the variance in estimating 
the inverse temperature parameter (3 can be expressed by the geometrical relation 

Var^B] = ±g ab V a (3V b f3 (14) 

on the unit sphere S, where V a /3 = d(3/dip a is the gradient of the temperature estimate 
(3. The essence of formula ([H]) can be understood as follows. First, recall that j3 is the 
expectation of the estimator B ab in the state ip a (/3). Suppose that the state changes rapidly 
as f3 changes. Then, the variance in estimating j3 is small — indeed, this is given by the 
squared magnitude of the 'functional derivative' of j3 with respect to the state ip a . On the 
other hand, if the state does not change significantly as (3 changes, then the measurement 
outcome of an observable is less conclusive in determining the value of (3. 

The squared length of the gradient vector V a /3 can be expressed as a sum of squares 
of orthogonal components. To this end, we choose a new set of orthogonal basis vectors 
given by the state ip a and its higher derivatives. If we let ipn denote ip a for n = 0, and 
for n > the component of the derivative d n ip a /df3 n orthogonal to the state ip a and its 
lower order derivatives, then our orthonormal vectors are given by ^ = ipnidbcipn^n) 2 
for n = 0, 1, 2, • • •. With this choice of orthonormal vectors, we find that the variance of the 
estimator B satisfies 

~ \ g^M ' (15) 

for any range of the index n. This follows because the squared magnitude of the vector 
| V a /3 = B ab ip b is greater than or equal to the sum of the squares of its projections onto the 
basis vectors given by ip® f° r the specified range of n. 

In particular, for n = 1 we have B a hipi4' b = \ on account of the relation B a hi/j a ip b = 
(3, and gab^i^i = \AH 2 , which follows from the thermal equation (|3|). Therefore, if we 
write Var^S] = Af3 2 , we find for n = 1 that the inequality ( |T5D implies the following 
thermodynamic uncertainty relation: 

A[3 2 AH 2 > 1 , (16) 

valid along the trajectory consisting of the thermal equilibrium states. The variance A 2 j3 
here is to be understood in the sense of estimation theory. That is, although the variable 
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(5 does not actually fluctuate, as should be clear from the definition of canonical ensemble, 
there is nonetheless an inevitable lower bound for the variance of the measurement, given 
by (|16|), if we wish to estimate the value of the heat bath temperature (3. It is worth 
pointing out that the exposition we have given here is consistent with the view put forward 
by Mandelbrot |[L2|| , who should perhaps be credited with first having introduced an element 
of modern statistical reasoning into the long-standing debate on the status of temperature 
fluctuations ]T5|. 



Note that, although we only considered the variance ((B — (B)) 2 ) here, the higher order 
central moments fi n = ((B — (B)) n ) can also be expressed geometrically. This can be seen as 
follows. First, recall that for any observable F ab with E^[F] = f, we have V a / = 2F ab ip b on 
the unit sphere S. Therefore, by letting F = B n , we construct the higher central moments in 
terms of the cosines of the angles between certain gradient vectors, e.g., 4/i 3 = g ab V ' a A*2Vb/3, 
4/^4 = g ab V 0,^2^ 'b^2 — 4//|j an d so on. In particular, the even order moments are expressible 
in terms of combinations of the squared lengths of normal vectors to the surfaces of constant 
central moments of lower order. 



E. Random states 

Let us return to the consideration of measurements on thermal states, which we now 
pursue in greater depth. In doing so we shall introduce the idea of a 'random state', a 
concept that is applicable both in clarifying the measurement problem in statistical physics, 
as well as in providing a useful tool when we consider ensembles. It also turns out that 
the idea of a random state is helpful in the analysis of conceptual problems in quantum 
mechanics. Later on when we consider quantum statistical mechanics, we shall have more 
to say on this. 

Suppose we consider a pure thermal state i^ a ((3) for some value j3 of the inverse temper- 
ature. We know that this state is given by 

= y,pT< , (17) 

k 

where u% is a normalised energy eigenstate with eigenvalue Ek, and pk is the associated Gibbs 
probability. After measurement, it is natural to consider the outcome of the measurement 
to be a random state *& a . Thus we consider \I/ a to be a random variable (indicated by use 
of a bold font) such that the probability for taking a given eigenstate is 

Prob[* a = u a k ] = Pk . (18) 

This way of thinking about the outcome of the measurement process is to some extent 
complementary to the density matrix approach, though in what follows we shall make it 
clear what the relationship is. 

In particular, the expectation of an observable X ab in the random state \I/ a is given by 
averaging over the random states, that is, 

E*[X\ = X ab * a V b . (19) 
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This relation should be interpreted as the specification of a conditional expectation, i.e., the 
conditional expectation of X ab in the random state *& a . Then the associated unconditional 
expectation E[X] = E[Eq,[X]] is given by 

E[X\ = X ab E[^ b ] . (20) 
However, since Prob[^ a = u%] — Pk, it should be evident that 

E[i& a ty b } = p ab , (21) 
where the density matrix p ab is defined by 

P ab = EPk« ■ (22) 

k 

Thus, the unconditional expectation of the random variable X ab is given, as noted earlier, 
by E[X] = X ab p ab . It should be observed, however, that here we are not emphasising the 
role of the density matrix p ab as representing a 'state', but rather its role in summarising 
information relating to the random state *f? a . 

The feature that distinguishes the density matrix in this analysis is that it is fully suf- 
ficient for the characterisation of unconditional statistics relating to the observables and 
states under consideration. This point is clearly illustrated when we calculate the variance 
of a random variable X ab in a random state \P a . Such a situation arises if we want to discuss 
the uncertainties arising in the measurement of an observable X ab for an ensemble. In this 
case the system we have in mind is a large number of identical, independent particles, each 
of which is in a definite energy eigenstate, where the distribution of the energy is given 
according to the Gibbs distribution. One might take this as an elementary model for a 
classical gas. Then the distribution of the ensemble can be described in terms of a random 
state \f ,a . Note that here the interpretation is slightly different from what we had considered 
before (the random outcome of a measurement for an isolated system), though it will be 
appreciated that the relation of these two distinct interpretations is of considerable interest 
for physics and statistical theory alike. 

The conditional variance of the observable X ab in the random state 1 5 ,a is given by 

Var*[X] = X ab X b * a * c - (X ab * a * b ) 2 . (23) 
The average over the different values of then gives us 

£[Var*[X]] = X ab X b p ac - X ab X cd p abcd , (24) 
where p ab is, as before, the density matrix (pT]), and 

p abcd ig a 

certain higher moment of ^ ,a , 

defined by 

p abcd _ E ^aq,b^,c^ _ ^ 

The appearance of this higher order analogue of the density matrix may be surprising, 
though it is indeed a characteristic feature of conditional probability. However, the uncon- 
ditional variance of X is not given simply by the expectation E [Var^[Jf]], but rather (see, 
e.g., [i~4|) by the conditional variance formula 
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Var[X] = £[Var*[X]] + Var[£*[X]] . (26) 

For the second term we have 

Vax[E*[X]] = E[(X ab * a * b ) 2 } - (E[X ab * a * b ]) 2 

= X ab X cd p abcd - (X ab p ab ) 2 , (27) 

which also involves the higher moment p abcd . The terms in fl2"E|) involving p abcd then cancel, 
and we are left with Var[X] = X ab X b p ac — (X ab p ab ) 2 for the unconditional variance, which, as 
indicated earlier, only involves the density matrix. It follows that the random state approach 
does indeed reproduce the earlier density matrix formulation of our theory, though the role 
of the density matrix is somewhat diminished. In other words, whenever conditioning is 
involved, it is the set of totally symmetric tensors 

p ab-cd _ E ^a^b_ ^c^ (28) 

that plays the fundamental role, although it suffices to consider the standard density matrix 
p ab when conditioning is removed. 

All this is worth having in mind later when we turn to quantum statistical mechanics, 
where the considerations we have developed here in a classical context reappear in a new 
light. We want to de-emphasise the role of the density matrix, not because there is anything 
wrong per se with the use of the density matrix in an appropriate context, but rather for two 
practical reasons. First of all, when we want to consider conditioning, exclusive attention on 
the density matrix hampers our thinking, since, as we have indicated, higher moments of the 
random state vector also have a role to play. Second, when we go to consider generalisations 
of quantum mechanics, such as the nonlinear theories of the Kibble- Weinberg type [^,0 , or 
stochastic theories of the type considered by Gisin, Percival, and others the density 



matrix is either an ill formulated concept, or plays a diminished role. We shall return to 
this point for further discussion when we consider quantum statistical mechanics in Section 
4. 



III. STATISTICAL PHASE SPACE 

A. Projective space and probabilities 

To proceed further it will be useful to develop a formalism for the algebraic treatment of 
real projective geometry, with a view to its probabilistic interpretation in the context of clas- 
sical statistical mechanics. Let Z a be coordinates for (n+ l)-dimensional real Hilbert space 
1~L n+l . Later, when we consider quantum theory from a real point of view we shall double 
this dimension. In the Hilbert space description of classical probabilities, the normalisation 
condition is written g ab Z a Z b = 1. However, this normalisation is physically irrelevant since 
the expectation of an arbitrary operator F ab is defined by the ratio 

TP 7 a 7^ 

(F > = ^ ' (29) 
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Therefore, the physical state space is not the Hilbert space 7i, but the space of equivalence 
classes obtained by identifying the points {Z a } and {XZ a } for all A G R — {0}. In this way, 
we 'gauge away' the irrelevant degree of freedom. The resulting space is the real projective 
n-space RP n , the space of rays through the origin in 7i n+1 . Thus, two points X a and Y a in 
j-^n+i are equivalent in RP n if they are proportional, i.e., X^ a Y b ^ = 0. 

The coordinates Z a (excluding Z a = 0) can be used as homogeneous coordinates for 
points of RP n . Clearly Z a and XZ a represent the same point in RP n . In practice one treats 
the homogeneous coordinates as though they define points of 7i n+1 , with the stipulation that 
the allowable operations of projective geometry are those which transform homogeneously 
under the rescaling Z a — ► XZ a . 

A prime, or (n — l)-plane in RP n consists of a set of points Z a which satisfy a linear 
equation P a Z a = 0, where we call P a the homogeneous coordinates of the prime. Clearly, 
P a and XP a determine the same prime. Therefore, a prime in RP n is an RP n ~ l , and the 
set of all primes in RP n is itself an RP n (the 'dual' projective space). In particular, the 
metric g ab on 7i n+1 can be interpreted in the projective space as giving rise to a nonsingular 
polarity, that is, an invertible map from points to hyperplanes in RP n of codimension one. 
This map is given by P a — > P a := g a bP h ■ See reference |1| for further discussion of some of 
the geometric operations employed here. 

If a point P a in RP n corresponds to a probability state, then its negation -iP a is the 
hyperplane P a Z a = 0. To be more precise, we take P a as describing the probability state for 
a set of events. Then, all the probability states corresponding to the complementary events 
lie in the prime P a Z a = 0. Thus, the points Z a on this plane are precisely the states that 
are orthogonal to the original state P a . The intuition behind this is as follows. Two states 
P a and Q a are orthogonal if and only if any event which in the state P a (resp. Q a ) has a 
positive probability is assigned zero probability by the state Q a (resp. P a ). This is the sense 
in which orthogonal states are 'complementary'. For any state P a , the plane consisting of 
all points Z a such that P a Z a = is the set of states complementary to P a in this sense. 

Distinct states X a and Y a are joined by a real projective line represented by the skew 
tensor L ab = X^- a Y b \ The points on this line are the various real superpositions of the original 
two states. The intersection of the line L ab with the plane R a is given by S a = L ab R b . Clearly 
S a lies on the plane R a , since R a S a = on account of the antisymmetry of L ab . 

The hyperplanes that are the negations (polar planes) of two points P a and Q a intersect 
the joining line L ab = P^ a Q b ^ at a pair of points P a and Q a respectively. That is, if P a = g a bP b 
is the coordinate of a plane and L ab represents a line in RP n , then the point of intersection 
is given by P a = L ab P b , and similarly Q a = L ab Q b . The projective cross ratio between these 
four points P a , Q a , P a = P a (Q b P b ) - Q a (P b P b ) and Q a = P a (Q b Q b ) - Q a (P b Q b ), given by 
P a QaQ b Pb/P c P c Q d Qd, reduces, after some algebra, to the following simple expression: 

K = p^q4 ' (30) 

which can be interpreted as the transition probability between P a and Q a . It is interesting 
to note that this formula has an analogue in quantum mechanics 0] . 

The projective line L ab can also be viewed as a circle, with P a and Q a antipodal to the 
points P a and Q a . In that case, the cross ratio k is |(1 + cos#) = cos 2 (6 l /2) where 9 defines 
the angular distance between P a and Q a , in the geometry of RP n . We note that 9 is, in 
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fact, twice the angle made between the corresponding Hilbert space vectors, so orthogonal 
states are maximally distant from one another. 

Now suppose we let the two states P a and Q a approach one another. In the limit the 
resulting formula for their second-order infinitesimal separation determines the natural line 
element on real projective space. This can be obtained by setting P a = Z a and Q a = Z a +dZ a 
in (0), while replacing 6 with the small angle ds in the expression |(1 + cos#), retaining 
terms of the second order in ds. Explicitly, we obtain 



ds 2 



dZ a dZ a {Z a dZ a ) 

z b z b (z b z b y 



(31) 



Note that this metric [20] is related to the metric on the sphere S n in 7i n+1 , except that in 
the case of the sphere one does not identify opposite points. 

We now consider the case where the real projective space is the state space of classical 
statistical mechanics. If we write ip a {f3) for the trajectory of thermal state vectors, as 
discussed in Section 2, then we can regard ip a (P), for each value of /3, as representing 
homogeneous coordinates for points in the state space RP n . Since i/j a (/3) satisfies @ it 
follows that the line element along the curve ip a {f3) is given by ds 2 = (H 2 )d(3 2 , which can 
be identified with the Fisher-Rao metric induced on the thermal trajectory by virtue of its 
embedding in RP n . This follows by insertion of the thermal equation (|3|) into expression 



(|31"|) for the natural spherical metric on RP r 



B. The projective thermal equations 

Let us write H a b for the symmetric Hamiltonian operator, and ip a {(3) for the one- 
parameter family of thermal states. Then in our notation the thermal equation is dip b = 
— \H b ip c df3. However, we are concerned with this equation only inasmuch as it supplies 
information about the evolution of the state of the system, i.e., its motion in RP n . We are 
interested therefore primarily in the projective thermal equation, given by 

ipl a dip b] = ~?p [a H b] ilj c df3 , (32) 

obtained by skew-symmetrising the thermal equation (Q) with ip a . Equation ( 132] ) defines the 
equilibrium thermal trajectory of a statistical mechanical system in the proper state space. 

The thermal equation generates a Hamiltonian gradient flow on the state manifold. This 
can be seen as follows. First, recall that a physical observable F is associated with a 
symmetric operator F abl and the set of such observables form a vector space of dimension 
|(n + l)(n + 2). Such observables are determined by their diagonal matrix elements, which 
are real valued functions on RP n of the form F(ip a ) = F a t,ip a il) b /if) c ip c . In particular, we are 
interested in the Hamiltonian function H(ip a ). Then, by a direct substitution, we find that 
the vector field H a = g ab dH/dtp b associated with the Hamiltonian function H takes the 
form H a = 2H^i/j b /i/j c i/j c . Therefore, we can write the differential equation for the thermal 
state trajectory in 7i n+1 in the form 



dp 4 



-g ab V b H . (33) 
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By projecting this down to RP n , we obtain 




(34) 



where V a H = g r V&i? . From this we can then calculate the line element to obtain 



which establishes the result we noted earlier. 

The critical points of the Hamiltonian function are the fixed points in the state space 
associated with the gradient vector field g ab Vf,i/. In the case of Hamilton's equations the 
fixed points are called stationary states. In the Hilbert space 7i n+l these are the points 
corresponding to the energy eigenstates u% given by H% u b k = E k u% (in the general situation 
with distinct energy eigenvalues E k , k = 0,1, • • • ,n). Therefore, in the projective space 
RP n we have a set of fixed points corresponding to the states u k , and the thermal states 
are obtained by superposing these points with an appropriate set of coefficients given by the 
Boltzmann weights. Since these coefficients are nonzero at finite temperature, it should be 
clear that the thermal trajectories do not intersect any of these fixed points. 

In particular, the infinite temperature (f3 = 0) thermal state tp a (0) is located at the 
centre point of S + in H n+1 . The distances from ip a (0) to the various energy eigenstates 
are all equal. This implies that in RP n the cross ratios between the fixed points and the 
state ifj a (0) are equal. Therefore, we can single out the point ifj a (0) as an initial point, and 
form a geodesic hypersphere in RP n . All the fixed points of the Hamiltonian vector field 
H a lie on this sphere. Since the cross ratios between the fixed points are also equal (i.e., 
maximal), these fixed points form a regular simplex on the sphere. The thermal trajectory 
thus commences at Z a (0), and asymptotically approaches a fixed point associated with the 
lowest energy eigenvalue Eq, as (3 — > oo. 

If we take the orthogonal prime of the thermal state for any finite (3, i.e., -iip a ((3), then 
the resulting hyperplane clearly does not contain any of the fixed points. On the other 
hand, if we take the orthogonal prime of any one of the fixed points u k , then the resulting 
hyperplane includes a sphere of codimension one where all the other fixed points lie. This 
sphere is given by the intersection of the original hypersphere surrounding ip a (0) with the 
orthogonal prime of the given excluded fixed point. There is a unique prime containing n 
general points in RP n . It is worth noting, therefore, that if we choose n general points given 
by Uj (j — 0, 1, • • • ,n;j ^ k), then there is a unique solution, up to proportionality, of the 
n linear equations X a u^ = 0, • • •, X a u® = 0. The solution is then given by X a = u k , where 
u k = el c ,,. d u b Qu\ • ■ ■ u^. Here, e a b -c = C[ a b-c], with n + 1 indices, is the totally skew tensor 
determined up to proportionality. 

It would be interesting to explore whether the orientability characteristics of RP n lead to 
any physical consequences. There may be a kind of purely classical 'spin-statistics' relation 
in the sense that the state space of half-integral spins are associated with a topological 
invariant, while the state space for even spins are not. 



ds 2 = 8V># 6 V[a# b] /(^c) 2 

= l -^ a V b] H4, [a V b] Hd{3 2 

= ^7 b HV b H^adp 2 

= {H 2 )d(3 2 , 



(35) 
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C. Hamiltonian flows and projective transformations 



We have seen in Section 3.B how the thermal trajectory of a statistical mechanical system 
is generated by the gradient flow associated with the Hamiltonian function. In other words, 
for each point on the state manifold we form the expectation of the Hamiltonian in that 
state. This gives us a global function on the state manifold, which we call the Hamiltonian 
function. Next, we take the gradient of this function, and raise the index by use of the 
natural metric to obtain a vector field. This vector field is the generator of the thermal 
trajectories. 

It is interesting to note that there is a relation between the geometry of such vector 
fields and the global symmetries of the state manifold. In particular, we shall show below 
that gradient vector fields generated by observables on the state manifold can also be in- 
terpreted as the generators of projective transformations. A projective transformation on a 
Riemannian manifold is an automorphism that maps geodesies onto geodesies. In the case 
of a real projective space endowed with the natural metric, the general such automorphism 
is generated, as we shall demonstrate, by a vector field that is expressible in the form of a 
sum of a Killing vector and a gradient flow associated with an observable function. 

To pursue this point further we develop some differential geometric aspects of the state 
manifold. We consider RP n now to be a differential manifold endowed with the natural 
spherical metric g ab . Here, bold upright indices signify local tensorial operations in the 
tangent space of this manifold. Thus we write V a for the covariant derivative associated 
with g ab , and for an arbitrary vector field V a we define the Riemann tensor -R abc d according 
to the convention 

V [a V b] V c = l -R ahA c V A . (36) 

It follows that -R a bcd := R ahc e gde satisfies -R abc d = #[ab][cd], -Rabcd = -Rcdab, and i?[ abc ]d = 0. 
In the case of RP" with the natural metric 

ds2 = szh&zgz. 



X(Z C Z, 



rl 2 



where Z a are homogeneous coordinates and A is a scale factor (set to unity in the preceding 
analysis), the Riemann tensor is given by 

-Rabcd = A(g ac g b d — fi'bcfi'ad) • (38) 

Now we turn to consider projective transformations on RP n . First we make a few general 



remarks about projective transformations on Riemannian manifolds [^1]]. Suppose we have 
a Riemannian manifold with metric g a b and we consider the effect of dragging the metric 
along the integral curves of a vector field £ a . For an infinitesimal transformation we have 

#ab -> #ab = #ab + C^ab ' ( 39 ) 

where C^g a b = 2V( a £ b ) is the Lie derivative of the metric (e << 1). The Levi-Civita 
connection V a associated with g ab is then defined by a tensor Q ab , symmetric on its lower 
indices, such that 
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V a \/ C = V a ^ c + eQ* h V b , (40) 

for an arbitrary vector field V a , and V a g bc = 0. A familiar line of argument based on the 
fundamental theorem of Riemannian geometry then shows that 

Qab = ^ (Va^bd + V b ^ ad + V d £ a b) , (41) 

where g ah is the inverse of g ab , given to first order in e here by g ah = g ah — 2eV (a £ b ). A 
short calculation making use of (^) then shows that to first order we have: 

Qab = V (a V b) r + R\ ah)d Z d . (42) 

Now, suppose the vector field U a satisfies the geodesic equation, which we write in the 
form (?7 a V a £/' b )£/ c ' = 0. It should be apparent that U a is geodesic with respect to the 
transformed metric g ab iff (U a V a U^- h )U c ^ = 0, or equivalently, U a U h Qj^U d ^ = 0. However, 
this relation is satisfied for all geodesic vector fields iff there exists a vector field a such that 
QaL = ^(a^b)- Hence we conclude that the vector field £ a is the generator of a projective 
transformation, mapping geodesies to geodesies, iff there exists a vector field 4> a such that 

V ( aV b )f + J R c (ab)d e d = 5 c { J b ) (43) 

On the other hand, it follows as a direct consequence of fl3~6f ) and the identity -R[ abc ]d = 
that for any vector field £ a we have 

V [a V b] r-i? c [ab] d£ d = 0. (44) 
Hence, combining ( |4*3"D and (53) we obtain 

VaVbF + R c had H d = 6 c ( J h) , (45) 



as a condition equivalent to ( f4"3"D for the existence of a projective transformation on the given 
manifold. This form of the condition is particularly useful for calculations (see, e.g., ||21|| ). 

In particular, if £ a is a Killing vector, so V( a £ b ) = 0, then Q4"5j ) is automatically satisfied, 
with a = since as a consequence of (|36|) a Killing vector necessarily satisfies V a V b £ c + 
i? c bad £ c = 0. Thus, a Killing symmetry generates a projective transformation. For example, 
in the case of RP n we have the symmetries of the projective orthogonal group. 

Now we derive an integrability condition for ([45]) that will be helpful in our analysis of the 
remaining projective transformations on RP n , apart from those associated with Killing sym- 
metries. Returning to the transformation ([39|), we consider the curvature -R a bc d associated 
with the metric ^ a b- Clearly, to first order in e this is given by 

^abc d = ^abc d + 6 A jR abc d ■ (46) 



On the other hand, it is well known (see, e.g., [J22| ) that for a change of connection given by 
( fl"0|) the corresponding change in the curvature is 

2^abc d = 2 R a h c A + eV [a<5 b d c + e2 Q r f a Qb]c ■ (47) 
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Thus by consideration of terms to first order in e we deduce that 



^R abc d = V [a Q« c , (48) 

with Q ah given as in equation fl42|) . This relation holds for any infinitesimal transformation of 
the form ( |59"[ ) generated by a given vector field £ a . In the case of a projective transformation, 
for which Q ab = 5(a0b), it follows, therefore, after some elementary rearrangement, that 

A^ab/ + <V b] c - V [a b] 5 c d = . (49) 

This is the desired integrability condition. In the case of a space of constant curvature we 
have (^) and hence 

C^R ahc A = 2AV (a £o<g - 2AV(b£c)tf • (50) 



Inserting this relation into equation (|49[ ) we obtain A ac g hd — A hc g ad = ^7[ a 4>b]9cd, where 
A a b = 2AV( a £t>) _ V a 0b- This implies A ab = and V[ a £b] = 0. For otherwise, we could 
contract A ab with a vector B c to obtain C a g bd — C b g a d = V[ a 0b]-E>d where C a = A ac B c . 
But this would imply that g a b is at most of rank two. It follows therefore that <p a is the 
gradient of a scalar, and that £ a is necessarily of the form 

T = v* + g &h v b H , (51) 

for some function H(x) on the state manifold, where r] a is a Killing vector, and a = 2XV a H. 
Returning this information to equation (|43[) , we conclude that 

V (a V b V c )i/ = 2\g {ah V c) H . (52) 

In other words, K a \j := V a Vb-^ — ^.Xg^H is a Killing tensor: V( a -ft'bc) = 0. However, this 
is the defining equation on RP n for observable functions, i.e., functions of the form 

- ^ • (53) 

in terms of homogeneous coordinates on RP n . Thus, we conclude that the generator of a 
projective transformation on the state space is given by the sum of a Killing vector and the 
gradient of an observable function. 

We remark, incidentally, that if the dimension n of the manifold is finite, then we can 
form the trace of (|45|) , which gives V a (V 2 if) = A(n + l)V a -H", from which we conclude 
that up to an additive constant, H(x) is an eigenfunction of the Laplacian operator, i.e., 
V 2 H = X(n + 1)H. The only global solutions of this equation on RP n are necessarily of 
the form fl53|). Such functions also satisfy fl5"2"|). In the case of infinite dimension the trace 
is not necessarily a valid operation, and we lose the representation of observable functions 
as eigenfunctions of the Laplacian. On the other hand, our characterisation of projective 
transformations as vector fields of the form (|5l"D , given by the sum of a Killing vector and 
the gradient of a scalar H(x) satisfying (^) and (|55D, is entirely general, valid in both 
finite and infinite dimensions. Setting A = 1, we see therefore that thermal states are 
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characterised by vector fields on RP n of the form £ a = g ab Vb-£^, where H(x) satisfies 
V (a V b V c) # = 2g (ah V c) H. 

It is worthwhile noting that among the trajectories generated by the Hamiltonian gra- 
dient flow, the thermal trajectory is singled out on account of the special initial state cor- 
responding to /3 — 0. This state is, of course, completely determined by the specification of 
the Hamiltonian function. In particular, for a given Hamiltonian H a b there are in general 
n + 1 fixed points in the state manifold. The initial state for the thermal trajectory is then 
uniquely determined by the condition that it is equidistant from these n + 1 fixed points. 



IV. QUANTUM STATISTICAL MECHANICS 

A. The quantum phase space 

In the algebraic approach to quantum theory one generally works with the totality of 
mixed states, while regarding pure states as extremal elements of this convex set [pL~0| | , and 



in the Heisenberg picture for dynamics. The approach we adopt here, however, will focus 
primarily on the system of pure states, and for dynamics we shall work in the Schrodinger 
picture. This is consistent with the point of view we put forward earlier in Section 2 in 
connection with the discussion of classical thermal states. 

Given a complex Hilbert space with complex coordinates Z a representing pure 

quantum states, we identify the state vector Z a with its complex multiples XZ a , A G C — {0}, 
to obtain the complex projective space CP n . Here we use Greek indices (a = 0, 1, • • • , n) for 
vectors in 7i^ +1 - Following the line of argument indicated earlier in the case of the thermal 
state space, we regard CP n as the true 'state space' of quantum mechanics. The status 
of density matrices will be discussed shortly. Our goal now is to build up the necessary 
projective geometric machinery appropriate for the consideration of quantum mechanics. 

Suppose we regard the state vector Z a as representing homogeneous coordinates on the 
projective Hilbert space. The complex conjugation of the state vector Z a is written Z a , with 
the corresponding Hermitian inner product Z a Z a . The complex conjugate of a point P a in 
CP n is the hyperplane (prime) P a Z a = 0. The points on this plane are the states that are 
orthogonal to the original state P a , and we denote this -iP a in the sense of the probability 
rules discussed above. Thus, CP n is equipped with a Hermitian correlation, i.e., a complex 
conjugation operation that maps points to hyperplanes of codimension one, that is, CP 71 ^ 1 . 

Distinct points X a and Y a are joined by a complex projective line L a/3 = X^Y®, 
representing the various complex, quantum mechanical superpositions of the original two 
states. The quantum mechanical transition probability between two states X a and Y a is 
then given by the cross ratio 

= 

X^X^Y S Y S 1 ; 

More precisely, we recall that if the system is in the state Y a and a measurement is made 
to see if the system is in the state X a (corresponding to a measurement of the observable 
represented by the projection operator X a Xg/X 7 X 7 ), then k is the probability that the 
result of the measurement is affirmative. If we set k = cos 2 {6/ '2) and set X a = Z a and 
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ya — + dZ a , retaining terms to second order, we recover the natural unitary-invariant 
metric on complex projective space, known as the Fubini-Study metric 1 2011 , given by 



ds 2 = 8(Z a Z a )- 2 Z [a dZ (3] Z [a dZ f3] . (55) 

Now suppose we write Hp for the Hamiltonian operator, assumed Hermitian. Then for 
the Schrodinger equation we have 

IF = ■ < 56 > 

However, we are more interested in the projective Schrodinger equation, given by 

Z [a dZ® = iZ [a H®Z~<dt , (57) 

which eliminates the superfluous degree of freedom associated with the direction of Z a . This 
equation is well defined on the projective state space. Insertion of ( |57D into (|55]) gives 

ds 2 = 4(H 2 )dt 2 , (58) 

which verifies that the velocity of a state along its trajectory in state space is ds/dt = 2 AH, 
where AH = (H 2 ) 1 / 2 is the energy uncertainty |23| . 

An alternative way of looking at this structure is to regard the state manifold CP n 
as a real manifold Ai 2n of dimension 2n, equipped with a Riemannian metric g a b and a 
compatible complex structure f2 a t>. Here again we use bold indices for tensorial operations 
in the tangent space of Ai 2n . In this formulation the Schrodinger equation takes the form 
of a flow £ a on M. 2n given by 

£ a = n ab V h H . (59) 

Here H is the real function on Ai 2n given at each point by the expectation of the Hamiltonian 
operator at that state. It is interesting to note that in the quantum mechanical case the 
dynamical trajectory is given by a Hamiltonian symplectic flow in contrast to the case of 
statistical mechanics where the relevant flow is given by a Hamiltonian gradient flow for 
the thermal trajectories. Furthermore, we note that in the case of quantum mechanics the 
flow is necessarily a Killing field, satisfying V( a £t>) = 0, where £ a = g a b£ b - In other words, 
the isometries of the Fubini-Study metric on CP n can be lifted to 7^q +1 to yield unitary 
transformations. In the statistical mechanical case, on the other hand, the general projective 
transformation of the manifold includes both Killing vectors and Hamiltonian gradient flows, 
but we exclude the former. In both quantum mechanics and statistical mechanics the fixed 
points of the Hamiltonian operator play a pivotal role. 



B. The quantum thermal states 

In Section 4. A above we have indicated two useful ways of looking at the quantum state 
space. The approach via complex homogeneous coordinates is appropriate when the global 
algebraic geometry of the state manifold is of interest. The differential geometric approach 



is useful when local properties of the state manifold are considered |L8|] . There is a third, 
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equally important approach, however, which we shall call the 'real' approach gJgj2J] to 
quantum mechanics. In this case we regard the Hilbert space of quantum mechanics as 
a real vector space of dimension 2n + 2. The importance of the real approach is that it 
links up directly with modern notions from probability theory and statistics. It should 
not therefore be surprising that this approach is also useful in considering the quantum 
dynamics of thermal states, and indeed in considering the relation between classical and 
quantum statistical mechanics. 

Let us write £ a (a = 1, 2, ■ • • , 2n + 2) for the real Hilbert space coordinates corresponding 
to a quantum mechanical state vector. The other ingredients we have at our disposal are the 
metric g a b on 'H 2n + 2 and a compatible complex structure tensor J h a . Therefore, we proceed 
in two stages. First we regard the state vector £ a as homogeneous coordinates for a real 
projective space RP 2n+1 of one dimension lower. This, of course, corresponds to factoring 
out by the freedom £ a — > A£ a with A real and nonvanishing. Then there is another more 
subtle freedom, corresponding to factoring out by the 'phase' freedom. This is given by the 
transformation 

r - \e*{? + iW) + \e-^(C - iJ a b e) , (60) 

where J% is the complex structure, and is a phase factor. In this way we obtain a map 
RP 2n+1 — > CP n , which is related to the Hopf map S 2n+1 — > CP n , and indeed we have the 
diagram 

\ I \ (61) 

R p2n+l > C pn 

showing the relation between these various spaces. The point of this line of reasoning is that 
if we take the classical statistical argument based on the state space RP 2n+1 , and introduce a 
complex structure on the real Hilbert space 7i 2n+2 , then the analogous relations for quantum 
statistical mechanics, and moreover, quantum dynamics of the equilibrium thermal states 
can be developed by studying the relation between RP 2n+l and CP n . 

Now we consider more explicitly the characterisation of thermal states in the context 
of quantum 'thermodynamics'. There are a number of features of the Hilbert space based 
description of conventional statistical mechanics that carry through to the quantum regime, 
though there are some new features as well. In particular, if we start with the real Hilbert 
space 7Y 2n+2 and the associated state space RP 2n+1 obtained when we neglect the complex 
structure on ~}{ 2n+2 given by quantum theory, then we can apply the theory developed earlier 
to define thermal trajectories in the space RP 2n+1 . The main difference arising between the 
classical and quantum cases is that in the quantum theory the Hamiltonian operator H a b 
satisfies the Hermitian property 

H ah J^J h d = H cd , (62) 

which ensures that it has at most n + 1 distinct eigenvalues, unlike the classical situation 
where there could be 2n + 2. The thermal equation giving the development of i/j a (f3) as the 
inverse temperature (3 is changed, is again given by 
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(63) 



where H ab = H ab - g ab H cd ip c ijj d / 'g e f^ e ^ J '■ 

Another feature that distinguishes the quantum theory is that in general there is a 
multiplicity of infinite temperature (/3 = 0) states, corresponding to different values of the 
'phases' associated with the state. Topologically, this implies that there is a torus T n = (S 1 ) 11 
in CP n , the points of which correspond to the /3 — state. Thus, for a specified infinite 
temperature state, i.e., for a given point on T n , we shall call the solution of (|63"D a 'primitive' 
thermal state parameterised by a set of phases. In other words, there is a multiplicity of 
primitive thermal trajectories, each corresponding to a choice of infinite temperature state. 
In the case of a two state system (e.g., a spin one-half particle) the two energy eigenstates 
correspond to the north and south poles of the state space CP 1 ~ S 2 , and the infinite 
temperature states are given by points on the equator taking different values of the phases. 
The thermal trajectories are then given by the geodesic segments that join the equator to 
one of the poles. This example is studied in more detail in Section 4.D. 

For many applications we have to consider ensembles of particles for which the starting 
point of the thermal trajectory is effectively random. In other words, although each particle 
in the ensemble has a definite thermal trajectory, when we form expectations based on the 
behaviour of the ensemble we have to average over the manifold of infinite temperature 
states, i.e., average over the random phase factors. This leads to the well known density 
matrix based description of thermal states, as we discuss below. Thus once again the density 
matrix formulation emerges as an essentially secondary construction, although of course it 
remains essential for many practical applications. 

Another characteristic of quantum thermal dynamics is that the thermal trajectories 
have a time evolution, given by the Schrodinger equation. Written in terms of real state 
vectors this evolution is given by 



for each thermal trajectory, where the variables collectively denoted <p label the /3 — states. 
Thus the primitive thermal states are pure states, subject to the usual evolutionary laws. 

In our earlier discussion of classical thermal systems, we introduced the idea of a random 
state. This notion is also useful in a quantum mechanical context. For example, a random 
state can be used to represent the outcome of a measurement process, or to describe the 
statistics of an ensemble. In particular, as we indicated earlier, in the case of a quantum 
thermal state we want to take an ensemble average over all possible infinite temperature 
states to describe the observed features of ensembles. This point will now be explored 
further. 

Suppose we denote by <fr the system of random phases labelling the (3 = states for a 
given quantum ensemble. The thermal state tp% is thus itself to be viewed as a random state, 
since <fr is random. Then if X ab is a typical observable, satisfying the Hermitian property, 
we can write -E#[A] = X ab ip^ip^ for the conditional expectation of X ab , given <fr. This is 
in keeping with the line of argument introduced earlier, for classical thermal systems. The 
ensemble average E[X] is then given by the unconditional expectation 




(64) 



E[E*[X]\ 



X ab p' 



ab 



(65) 
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where 



p a \p) = E[r^i] (66) 

is the Hermitian density matrix characterising the state of the ensemble. Here the un- 
conditional expectation E[—] averages over the phase variables appearing in random state 
■0^(0, 0). In this connection it is worth noting that the infinite temperature states have the 
property that the ensemble average in formula fl6"6|) is proportional to the metric for (3 = 0, 
that is, p ab (0) = 2^+29 ab ' where 2n + 2 is the dimension of the real Hilbert space. 

It is interesting to take note of the topological characteristics of the infinite temperature 
state manifold. As we already remarked, this manifold can be viewed as an n-torus T n 
sitting in the state manifold CP n . This can be seen as follows. First, recall the fact 
that the (3 = states are equidistant from the fixed points u% of the Hamiltonian flow 
in CP n , and that these fixed points form a regular simplex. Thus, we would like to find 
the locus of points Z a such that the distances between Z a and u% are equal for all k. 
Since the points uf. are vertices of a regular simplex, we can choose coordinates, without 
loss of generality, such that Uq = (1,0, (),•••), = (0, 1,0, •••), and so on. Thus, by 
writing the equidistance condition in terms of the cross ratio explicitly, we find that the 
locus of Z a = (Z°, Z l , ■ ■ ■ , Z n ) is given by the simultaneous solution of the following system 
of equations: Z°Z = Z X Z\,Z X Z\ = Z 2 Zi,- • • ,Z n Z n = Z°Z . We thus deduce that the 
solution takes the form 

Z a = (e^°,e^V"-,e* 1 ) , (67) 

for some phase variables <j)j. This implies that the topology of the infinite temperature 
states manifold indeed has a structure of an n-torus T n = (S* 1 )™ (one of the n + 1 phase 
variables <pj is redundant, and can be scaled away), as claimed earlier. In summary, the 
fundamental torus T n C CP n associated with a given Hamiltonian function consists of all 
those states that are equally distant to the eigenstates of the specified Hamiltonian with 
respect to the Fubini-Study metric on CP n . The significance of T n is that it embodies a 
geometrical representation of the concept of the extraneous phases over which one has to 
integrate for many purposes in thermal physics. 



C. The equilibrium condition 

The property that the density matrix p ab (0) at infinite temperature is proportional to the 



metric is, in fact, closely related to the KMS-condition fll0| , |25|l , which provides an alternative 
characterisation of thermal equilibrium states. It is therefore interesting to see how the KMS 
construction fits into the present description of thermal phenomena, and here we shall briefly 
develop some of the relevant ideas. First, recall that a general state, or density matrix, can be 
regarded as a semidefinite map p(A) from Hermitian operators A to the real numbers given 
by p(A) = Tr(pA), satisfying p(XX) > for all X, and p(g) = 1, where g is the identity 
operator. Now the trace operation has the property Tr(AB) = Tt(BA). For a general 
state p, on the other hand, clearly we do not have p{AB) = p(BA). However, the thermal 
equilibrium states pp{—) are characterised by a slightly weakened form of commutation 
relation, which is the KMS condition. 
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Suppose for a Hermitian operator A we write r t A for the unitary action of the one- 
parameter group of time translations given by r t A = e itH Ae- itH . Then, we can extend this 
definition to complex time by writing T t+i pA = e l ( t + l i 3 ) H y \ e - t ( t + l f 3 ) H _ The KMS condition on 
a state p is that for all Hermitian operators A and B it should satisfy 

p(r t AB) = p{Br t+i( ,A) . (68) 

It is not difficult to see that in finite dimensions this implies that p is the thermal state pp 
discussed above, and that commutivity holds for infinite temperature. 

Now let us look at these relations from a real point of view. First we need to consider 
the action of the one-parameter group of time translations. Suppose we have a real Hilbert 
space of dimension 2n + 2, with typical elements £ a , vf G 7i. On 7i we also have the metric 
g a b and the complex structure J b , satisfying J^J b = — 5 b and J^J^gcd = 9 at- The tensor 
Q a b = QacJb is thus antisymmetric, and defines a symplectic structure on Tt. With these 
ingredients at hand, we can define the action of the orthogonal and unitary groups on Ti. 

For a typical element £ a G Ji the orthogonal group 0(2n + 2) consists of transformations 
£ a — > O b 2 t; b , satisfying O^O^g c d = g^. The unitary group U(n + 1) can then be regarded as 
a subgroup of 0(2n + 2), given by transformations £ a — > t/^£ b satisfying 

u c a u$gai = g ab , u c a ufa cd = n ab . (69) 

In other words, transformations that preserve both the metric and the symplectic structure 
on 7i. For typical real elements £ a , if G Ti their ordinary real Hilbert space inner product 
is, of course, given by f] a g a b^, b , which is invariant if £ a and rf are subject to orthogonal 
transformations. On the other hand, the standard Dirac product (77 1£) between two real 
vectors £ a and rf ', given by 

(V\0 = ^(g^-iQ^e , (70) 

is invariant only under unitary transformations. This should be clear from the need to 
preserve the real and imaginary part of ( [70]) separately, which implies (|69|). 



If 0\ is an orthogonal transformation, then the corresponding inverse element O b a is 
given by the transpose 0\ = g hc g at }P d c , which satisfies O^O b = 5%. In the case of a unitary 
transformation we can, without ambiguity, use the notation JJ\ = g bc g a dU d for the conjugate 
transformation, from which it follows that 

U£U b c = 6 a c , U^J b U c d = 5 a d (71) 

are equivalent to ( |69j) in characterising unitary transformations. In particular, (|7T|) can be 
viewed as saying that the unitary transformations on 7i are orthogonal transformations that 
also preserve the complex structure. 

In equation ( ffl|) we see the action of the unitary group on the complex structure tensor. 
More generally, for a typical multi-index tensor A^ c we define the action of the unitary group 
by Al c — ► U^A^, c ,U b U° . Here the primed indices merely serve to increase the size of the 
standard alphabet. 

Now suppose we consider one-parameter subgroups of orthogonal group, continuous with 
the identity. Such transformations are of the form O b = exp [tM ac g bc ] where the tensor 
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M a b is antisymmetric. For a one-parameter family of unitary transformations we have to 
specialise further, and require M a b to be of the form M a b = ^acH^, where H a b is symmetric 
and Hermitian. Indeed, if H ab is symmetric, then a necessary and sufficient condition that 
it should be Hermitian is that fl ac H b is antisymmetric. It follows that the general one- 
parameter group of unitary transformations, continuous with the identity, can be written 

r • (72) 

If H a b is the Hamiltonian, then is the Schrodinger evolution generated from the given 
initial state. The point of view here continues to be purely 'real' in the sense that the complex 
structure tensor is playing the role of the factor of 'i' in the conventional expression of 
unitary evolution given in the Dirac notation by |£) — > e ltH \£). Our goal is to formulate a 
similar geometrisation for the KMS condition. 

For this purpose we need to consider the changes implied for the picture noted above 
when we go to the Heisenberg representation. Suppose A a b is an observable (symmetric, 
Hermitian operator) and we consider the evolution of its expectation A a b^ a ^ b under the 
action of the unitary transformation £ a — > U^ b with = exp[iJ"iJ£]. The Heisenberg 
picture is obtained if we let £ a be stationary, and let A ab evolve according to the scheme 

A ab - U c a U*A cd . (73) 

It will be appreciated that the evolution of A ab in the Heisenberg representation is contra- 
gradient to the 'natural' action of the unitary group A a b — > U% IJ\ A a iy discussed earlier, 
since the natural action of the unitary group is defined to be the action on A ab that pre- 
serves A a b^ a ^ b when £ a is evolved in the Schrodinger representation. By the same token, in 
the Heisenberg representation the action of the time evolution operator on an observable, 
represented in 'operator' form A b a (rather than A a b) is given by 

A b a -> U:'A b ;,U b , (74) 

where XJ a a ' = exp[tJ^' H b ] and U a a ' = exp[-t J? H b \. 

Now, finally, we are in a position to address the KMS condition. First we note that for 
a pair of observables A ab and B a b, their quantum mechanical 'Dirac' product C = AB is 
given by 

Ca h = A c{a A cd B h)d , (75) 

where A ab = \(g ab — iVL ab ). Clearly, C ab is a complex tensor: its real and imaginary parts are 
given respectively by the Jordan product \{AB + BA) and the commutator \i{AB — BA). 
In particular, we have 

Cab = \A\ a Bf )9cd - UAlX^a ■ (76) 

It follows that if p a b is a density matrix, the expectation p(AB) in the corresponding 
state is given by p(AB) = p^A b c A d B d , or equivalently 

p{AB) = \ Pab A a c B bc - l -i Pab A a c B b d W d . (77) 



C - a = exp 
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As we noted earlier, the action of the one-parameter group r t on A is given by 



A. 



r t A b a 



exp 



At 



exp 



-tJ h ,H n 



(78) 



The KMS complexification of this action, corresponding to replacing t with t + i(3, replaces 



t J b a with t J b a 



(38 a , and we have 



exp 



u^i - mi 



K> exp 



-tJ b ,H d + (3H b 



It is then straightforward to verify that in the case of a thermal state, for which 



(79) 



(80) 



the KMS condition pp{j t AB) = p/3(BT t+i/3 A) is indeed satisfied. 

It might be argued that the complexification t — > t + i(3 has an artificial character when 
seen from a real point of view, since it involves an operator transformation of the form 
t8 a — > t8 a + (3J b . Moreover, equation ( ff9|) takes one outside the category of symmetric 
tensors. Since we would like to argue that the 'real' approach to quantum theory acts as the 
natural bridge between quantum dynamics, on the one hand, and modern statistical theory 
and hence thermal physics, on the other hand, we are thus led to the negative conclusion that 
the KMS condition, despite its historical significance in the development of modern thermal 
physics, should not be regarded as fundamental. This is in keeping with our emphasis on 
so-called primitive thermal states, which play a significant role even before the consideration 
of ensemble behaviour. 



D. Phase space dynamics and temperature 

In the foregoing formulation, we have adopted the view point that, given a system in 
heat bath with inverse temperature (3, we let the equilibrium thermal states evolve quantum 
mechanically under the influence of the Hamiltonian. We have also considered the case of 
an ensemble of particles in a thermalised box, whereby while each particle takes a definite 
energy value, the probability law of the energy for a random element of the ensemble being 
given by the Boltzmann distribution. In either case, the value of (3 is taken as an 'input' 
variable that specifies the thermal state. That is to say, we have adopted the standard 
canonical ensemble of distributions for which temperature is operationally defined. 

This point of view can, to some extent, be inverted by shifting gears from the standard 
canonical description to the microcanonical ensemble. In this case, the essential equivalence 
of the two approaches is typically recognised only for large systems (see, e.g., [26]). The 



dynamical formulation we provide here will be useful, in particular, for small systems such 
as those on a quantum or mesoscopic scale. 

First, we regard as given a Hamiltonian function H defined on the quantum mechanical 
state space CP n , which is viewed as a real manifold of dimension In. We then foliate the 
real state manifold with constant energy surfaces, given by H(x) = E. The volume V(E) of 
such a surface £f n_1 , given by 
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v < E > - 4- * « • (81) 

then tells us the number of microscopic states having the energy E. Here, we have written 
da a = g ab ebc-ddx c ■ ■ -dx d for the natural vector-valued (2n — l)-form on CP n (viewed as a 
real manifold). Therefore, using the Boltzmann relation, the entropy S(E) associated with 
the energy surface Se is given by S(E) = \nV(E). As a result, the inverse temperature can 
be calculated from the usual prescription 

dS(E) , , 

In this way, we can calculate the temperature of the system directly from the given quantum 
mechanical dynamics of the system, or equivalently, given the inter-relationship of the energy 
surfaces. 

The construction just noted for obtaining temperature from the underlying phase space 
dynamics is well known in classical statistical mechanics (see, e.g., ||27|| ). The novelty here, 



however, is to regard the quantum mechanical state space CP n as a real 2n-dimensional 
manifold playing the role of the quantum mechanical phase space, hence allowing us to 
obtain the temperature for quantum mechanical dynamical systems in the case of the mi- 
crocanonical ensemble. Based upon this formulation, we can also investigate the possibility 
of various generalisations of the temperature, when the underlying quantum mechanical dy- 
namics is modified. There are a number of distinct generalisations that can be pursued in 
this context. 

First, we can replace the Hamiltonian function defined on the state manifold by a more 
general observable. In this case, the underlying theory would correspond to the Kibble-type 
theories of nonlinear quantum mechanics, for which the above prescription would provide 
the temperature in the case of a nonlinear dynamical system of this sort. Secondly, we 
can consider replacing the Fubini-Study metric on the state space by a more general metric. 
Alternatively, a more radical extension is obtained by replacing the state space manifold itself 
with a general Kahler manifold. In such generalisations the notion of particle states may no 
longer survive. On the face of it, this might appear to be an undesirable feature. However, 
our view is that at high energies, as in the instance of particle collisions, the notion of particle 
states as such may be lost in any case, and recovered only at an asymptotic level, i.e., on the 
tangent space. For such generalisations the notion of temperature we have outlined above 
would nonetheless survive, indicating a strong interlink between the geometrical structure 
of the state manifold and the thermodynamics of associated statistical ensembles. 



E. Spin one-half particle 

In this section, we study the thermal dynamics of systems having two energy levels. 
For such systems, it is easy to see that most physical quantities of interest depend only on 
the energy difference but not on the actual values of the energies. Therefore, for practical 
purposes any two level system can be viewed as essentially equivalent to a system consisting 
of a spin one-half particle interacting with an external field. 

We study the classical situation first. In this case, we have an Ising spin in a constant 
magnetic field whose strength is h. The associated real Hilbert space Ti 2 is two dimensional, 
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with orthogonal axes given by the spin-up state and the spin-down state w". If we view H, 2 
as an x-y plane, then the infinite temperature {(3 = 0) state is the intersection of the segment 
S]_ of the unit circle with the line y = x. As the temperature decreases, the equilibrium 
state moves towards u° or depending on the sign of the magnetic field h. 

The unnormalised thermal state i\) a in H 2 can be obtained by solving the equation 
dip a /df3 = -±H£4> b , with the result 

^ a (/3) = e/"» u « + e -t»> u a . ( 83 ) 

This can be normalised by dividing the right hand side by the partition function Q(j3) = 
2cosh(/3/i). We now project this space down to RP 1 , which is effectively a circle. Since a 
circle in H 2 is a double covering of RP 1 ~ S 1 , the two eigenstates are mapped to opposite 
points in RP 1 . 

Our geometric 'quantisation' procedure for this system is as follows. For the given ther- 
mal state ip a (P) we assign a phase factor, and thus obtain a quantum mechanical state 
space CP 1 , which is viewed as a sphere S 2 . The north and the south poles of the sphere 
correspond to the two energy eigenstates, and the great circles passing through these two 
points correspond to unitary equivalent thermal state space trajectories parameterised by 
the phase factor. The equator of the sphere, in particular, corresponds to the infinite tem- 
perature states. This circle is of course a 1-torus, in accordance with the general description 
of the infinite temperature state manifold given earlier. The Schrodinger dynamics is given 
by a rigid rotation of the sphere about the axis that passes through the north and the south 
poles, the two stationary points. This rotation gives rise to a Killing vector field on the 
sphere, where the angular velocity is given by the strength of the external field h. Thus, 
the temperature value specifies the latitude on the sphere and the Schrodinger evolution 
corresponds to a latitudinal circle. 

To pursue this in more detail we introduce a complex Hilbert space with coordinates Z a 
(a = 0, 1) which we regard as homogeneous coordinates for CP 1 . The complex conjugate 
of Z a is the 'plane' Z a in CP 1 , which in this dimension is simply a point. The point 
corresponding to Z a is then given by Z a = e a ^Zp, where e Q/3 is the natural symplectic form 
on the two-dimensional Hilbert space. The relevant formalism in this case is, of course, 
equivalent to the standard algebra of two-component spinors. In particular, by use of the 
spinor identity 2X^ a Y^ = e Q/3 X 7 F 7 , where X a e a(3 = X p , we obtain 

for the Fubini-Study metric in this situation, and 

Z^ = iH aP Z a Z^dt (85) 

for the projective Schrodinger equation. The Hamiltonian iff here has a symmetric repre- 
sentation of the form 

H a , ~ jp^y (86) 

where P a and P a correspond to the stationary points. In particular, we have H^P 13 = hP a 
and HgP 13 = —hP a , which follow if we bear in mind the identity P 7 P 7 = — P 7 P 7 . 
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By insertion of (j8~B|) into (|85| ) and then (^) into fl34|), we deduce that 

16/i 2 Prob[Z Q -> P Q ]Prob[Z Q -> P a ] , (87) 



' ds\ 2 



where Prob[Z a — > P a ] is the transition probability from Z Q to the north pole P Q , given by 
the cross ratio 

(^Z-Z/^Prp*) " 2 (1 + C ° S ^- (88) 

Here, 9 is the distance from Z a to P°, given by the usual angular coordinate measured 
down from the north pole. The complementary probability Prob[Z Q — > P a ] is given by 
|(1 — cos#), and it follows that the velocity of the trajectory through the state space is 

— = 2/isinfl , (89) 

Ct/t' 

a special case of the Anandan-Aharonov relation |23] noted above. For the evolutionary 
trajectory we obtain 

Z a = cos-e %{ht+ ^P a + sm-e- i{ht+(t,) P a , (90) 
2 2 v ; 

where 9 and <fi are the initial coordinates on the sphere for t = 0. If appropriate, Z a can be 
normalised by setting P a P a = 1. A short calculation then shows that the expectation E of 
the energy is given by E = hcos9, and that the variance of the energy is given by h 2 sin 2 9. 

Now, we are in a position to examine the primitive thermal trajectories associated with 
the spin one-half case. In this case the thermal equation is given by 

W ^ - fr* • (91) 

where H% = H$ - 5^H^Z v Z fl /Z"'Z T By use of flU) we then obtain 

c —~(3h+i(f> -pa I c \f5h—i4> pa 

7 a (Q9\ 

( e -ph + e 0hy/2 ' y J ^> 

where we assume the normalisation P a P a = 1. This shows that at infinite temperature 
(/3 = 0) the state lies on the equator, given by Z a = e i4 >P a + e~ i4, P a , where lies in the 
interval from to 2n. Zero temperature state is obtained by taking the limit (3 — - > oo, and 
we find that Z a approaches P a , the south pole, provided h > 0. For the expectation of the 
energy E = H^Z^Z a we have 

-f3h _ J3h 

E = h e -, h + e , h W 

which, as expected, ranges from to — h as j3 ranges from to oo. We note, in particular, 
that E is independent of the phase angle 0, and that the relation E = htanh(/3h) agrees 
with the result for a classical spin. 



27 



For other observables this is not necessarily the case, and we have to consider averaging 
over the random state Z a obtained by replacing with a random variable having a 
uniform distribution over the interval (0, 2ir). Then for the density matrix p% := E^L a 7i^\, 
where E[—] is the unconditional expectation, we obtain 



p -f3h pa p _ J3h pa p 



The fact that p% has trace unity follows from the normalisation condition Z a Z a = 1, and 
the identity Z a Z a = —Z a Z a . For the energy expectation p{H) = H^p^ we then recover 

(ED- 

Alternatively, we can consider the phase-space volume approach considered earlier, by 
assuming a microcanonical distribution for this system. Now, the phase space volume of 
the energy surface 8\ (a latitudinal circle) is given by V(E) = 2tt sin 9, where 9 is the 
angle measured from the pole of CP 1 ~ S 2 . Hence, by use of (p2|), along with the energy 
expectation E = hcos9, we deduce that the value of the system temperature is 

m = ^— . ( 9 5) 

Since E < and E 2 < h 2 , the inverse temperature (3 is positive. Furthermore, we see 
that E — implies 9 = tt/2, the equator of the sphere, which gives infinite temperature 
(P = 0), and E 2 = h 2 corresponds to 9 = tt, which gives the zero temperature (/3 = oo) 
state. The equation of state for this system can also be obtained by use of the standard 
relation /3p = OS/dV. In this case, we obtain the equation of state for an ideal gas, i.e., 
pV = P^ 1 . Explicitly, we have 

p(E) = - J^Vh 2 -E 2 . (96) 

The pressure is minimised when the spin aligns with the external field, and is maximised 
at the equator. We note that for positive energies E > the temperature takes negative 
values. If we take E < and then flip the direction of the external field h, this situation 
can be achieved in practice Although the concept of such a negative temperature is used 



frequently in the study of Laser phenomena, it is essentially a transient phenomenon [28 
and is thus not as such an objective of thermodynamics. 

We note, incidentally, that the distinct energy-temperature relationships obtained here 
in equation ( |9"5D for the canonical ensemble and in (^Sj) for the microcanonical ensemble, 
have qualitatively similar behaviour. Indeed, for a system consisting of a large number of 
particles these two results are expected to agree in a suitable limit. 



V. DISCUSSION 

The principal results of this paper are the following. First, we have formulated a pro- 
jective geometric characterisation for classical probability states. By specialising then to 
the canonical ensemble of statistical mechanics, we have been able to determine the main 
features of thermal trajectories, which are expressed in terms of a Hamiltonian gradient 
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flow. This flow is then shown to be a special case of a projective automorphism on the 
state space when it is endowed with the natural RP n metric. It should be clear that the 
same formalism, and essentially the same results, apply also to the grand canonical and the 
pressure-temperature distributions. 

The quantum mechanical dynamics of equilibrium thermal states can be studied by 
consideration of the Hopf-type map RP 2n+1 — > CP n , which in the present context allows 
one to regard the quantum mechanical state space as the base space in a fibre manifold 
which has the structure of an essentially classical thermal state space. The fact that a 
projective automorphism on a space of constant curvature can be decomposed into two 
distinct terms suggests the identification of the Killing term with the Schrodinger evolution 
of the Hamiltonian gradient flow with respect to the symplectic structure, and the other 
term with thermal evolution of the Hamiltonian gradient flow with respect to the metric — 
the former gives rise to a linear transformation, while the latter is nonlinear. 

There are a number of problems that still remain. First, much of our formulation has been 
based on the consideration of finite dimensional examples. The study of phase transitions, 
however, will require a more careful and extensive treatment of the infinite dimensional case. 
Our analysis of the projective automorphism group, on the other hand, suggests that the 
infinite dimensional case can also be handled comfortably within the geometric framework. 
Also, for most of the paper we have adopted the Schrodinger picture, which has perhaps 
the disadvantage of being inappropriate for relativistic covariance. It would be desirable to 
reformulate the theory in a covariant manner, in order to study the case of relativistic fields. 

Nevertheless, as regards the first problem noted above, the present formulation is suffi- 
ciently rich in order to allow us to speculate on a scenario for the spontaneous symmetry 
breaking of, say, a pure gauge group, in the infinite dimensional situation. In such cases 
the hypersurface of the parameter space (a curve in the one-parameter case considered here) 
proliferates into possibly infinite, thermally inequivalent hypersurfaces, corresponding to the 
multiplicity of the ground state degeneracy, at which the symmetry is spontaneously broken. 
The hyper-line characterising the proliferation should presumably be called the spinodal line 
(cf. [p9|), along which the Riemann curvature of the parameter space manifold is expected to 



diverge. Furthermore, a pure thermal state in the 'high temperature' region should evolve 
into a mixed state, obtained by averaging over all possible surfaces, by passing through 
the geometrical singularity (the spinodal boundary) characterising phase transitions. By a 
suitable measurement that determines which one of the ground states the system is in, this 
mixed state will reduce back to a pure state. In a cosmological context this proliferation 
may correspond, for example, to different vacuums It is interesting to note that the 



situation is analogous to the choice of a complex structure ||3T| for the field theory associated 
with a curved space-time, as the universe evolves. 

In any case, the remarkable advantages of the use of projective space should be stressed. 
As we have observed, the structure of projective space allows us to identify probabilistic 
operations with precise geometric relations. One of the problems involved in developing 
nonlinear (possibly relativistic) generalisations of quantum mechanics concerns their prob- 
abilistic interpretation. Formulated in a projective space, such generalisations can be ob- 
tained, for example, by replacing the Hamiltonian function by a more general function, or 
by introducing a more general metric structure. In this way, the assignment of a suitable 
probability theory can be approached in an appropriate way. In particular, as we have ob- 
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served, the canonical ensemble of statistical mechanics has an elegant characterisation in 
projective space — but this is an example of a theory that is highly nonlinear and yet purely 
probabilistic. 

It is also interesting to observe that the nonlinear generalisation of quantum mechanics 
considered by Kibble || and others can be applied to the thermal situation, in the sense that 
Hamiltonian function defined on the state manifold can be replaced by a general observ- 
able. For such generalisations it is not clear what physical interpretation can be assigned. 
Naively, one might expect that by a suitable choice of an observable the resulting trajectory 
characterises some kind of nonequilibrium process. 

One of the goals of this paper has been to formulate quantum theory at finite temperature 
in such a way as to allow for the possibility of various natural generalisations. These might 
include, for example, the stochastic approach to describe measurement theory, or nonlinear 
relativistic extensions of standard quantum theory, as noted above. These generalisations 
will be pursued further elsewhere. 

Acknowledgement. DCB is grateful to Particle Physics and Astronomy Research Council 
for financial support. 

* Electronic address: d.brody@damtp.cam.ac.uk 
f Electronic address: lane@ml.com 

1. Haag, R. and Kastler, D., J. Math. Phys. 5, 848 (1964). 

2. Landsmann, N.P., Aspects of Classical and Quantum Mechanics (Springer, New York 
1998). 

3. Kibble, T.W.B., Commun. Math. Phys. 65, 189 (1979). 

4. Gibbons, G.W., J. Geom. Phys. 8, 147 (1992). 

5. Hughston, L.P., in Twistor Theory, ed. Huggett, S. (Marcel Dekker, Inc., New York 
1995). 

6. Brody, D.C. and Hughston, L.P., "Statistical Geometry", kr-qc/9701051 . 



7. Brody, D.C. and Hughston, L.P., "Geometry of Thermodynamic States", |quant- 
ph/9706030 . 



8. Brody, D.C. and Hughston, L.P, Phys. Rev. Lett. 77, 2851 (1996). 

9. Gibbons, G.W., Class. Quant. Grav. 14, A155 (1997). 

10. Haag, R., Hugenholtz, N.M. and Winnink, M., Commun. math. Phys. 5, 215 (1967). 

11. Xia, D.X., Measure and Integration Theory on Infinite- Dimensional Spaces (Academic 
Press, New York 1972). 

12. Mandelbrot, B., Ann. Math. Stat. 33, 1021 (1962); Phys. Today, (January 1989). 

13. Feshbach, H., Phys. Today, (November 1987); Kittel, C, Phys. Today, (May 1988). 



30 



14. Ross, S., Stochastic Process, (Wiley, New York 1996). 

15. Kibble, T.W.B., Commun. Math. Phys. 64, 239 (1978). 

16. Weinberg, S., Phys. Rev. Lett. 62, 485 (1989). 

17. Gisin, N. and Percival, I., Phys. Lett. A167, 315 (1992). 

18. Hughston, L.P, Proc. Roy. Soc. London 452, 953 (1996). 

19. Hughston, L.P. and Hurd, T.R., Phys. Rep. 100, 273 (1983). 

20. Kobayashi, S. and Nomizu, K. Foundations of differential geometry, vols. 1 and 2 
(Wiley, New York, 1963 and 1969). 

21. Tomonaga, Y., Riemannian Geometry, (Kyoritsu Publishing Co., Tokyo 1970). 

22. Hughston, L.P. and Tod, K.P., An Introduction to General Relativity, (Cambridge 
University Press, Cambridge 1990). 

23. Anandan, J. and Aharonov, Y., Phys. Rev. Lett. 65, 1697 (1990). 

24. Ashtekar, A. and Schilling, T.A., "Geometrical Formulation of Quantum Mechanics", 
gr-qc/9706069 . 

25. Bratteli, O. and Robinson, D.W., Operator algebras and quantum statistical mechanics, 
vols. I and II (Springer, New York 1979,1981). 

26. Ruelle, D., Statistical Mechanics: Rigorous Results (Addison- Wesley, New York 1989). 

27. Thompson, C.J., Mathematical Statistical Mechanics (Princeton University Press, 
Princeton 1972). 

28. Kubo, R., Statistical Mechanics, (Kyoritsu Publishing Co., Tokyo 1951). 

29. Brody, D. and Rivier, N., Phys. Rev. E 51, 1006 (1995). 

30. Sacha, R.G., Phys. Rev. Lett. 78, 420 (1997). 

31. Gibbons, G.W. and Pohle, H.J., Nucl. Phys. B410, 117 (1993). 



31 



